Multiple linear regression for protein secondary structure prediction.

نویسنده

  • X M Pan
چکیده

In the present work, a novel method was proposed for prediction of secondary structure. Over a database of 396 proteins (CB396) with a three-state-defining secondary structure, this method with jackknife procedure achieved an accuracy of 68.8% and SOV score of 71.4% using single sequence and an accuracy of 73.7% and SOV score of 77.3% using multiple sequence alignments. Combination of this method with DSC, PHD, PREDATOR, and NNSSP gives Q3 = 76.2% and SOV = 79.8%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

Predicting protein secondary structure and solvent accessibility with an improved multiple linear regression method.

We have improved the multiple linear regression (MLR) algorithm for protein secondary structure prediction by combining it with the evolutionary information provided by multiple sequence alignment of PSI-BLAST. On the CB513 dataset, the three states average overall per-residue accuracy, Q(3), reached 76.4%, while segment overlap accuracy, SOV99, reached 73.2%, using a rigorous jackknife procedu...

متن کامل

QSAR Prediction of Half-Life, Nondimentional Eeffective Degradation Rate Constant and Effective Péclet Number of Volatile Organic Compounds

In this work some quantitative structure activity relationship models were developed for prediction of three bioenvironmental parameters of 28 volatile organic compounds, which are used in assessing the behavior of pollutants in soil. These parameters are; half-life, non dimensional effective degradation rate constant and effective Péclet number in two type of soil. The most effective descripto...

متن کامل

Determination of standardized prececal protein digestibility of canola meal in British United Turkeys Big 6 at different ages using multiple linear regression procedure

ABSTRACT- Ninety six 2-week-old British United Turkeys (BUT) Big 6 were assigned to 2 groups (4 replicates of 12 birds each). One basal diet and another one in which 20% canola meal replaced the corn starch in the basal diet were prepared. At 9, 16 and 21 wk of age, four birds from each pen slaughtered for digesta collection. The last two-third segment of the intestine, between the Meckel’s div...

متن کامل

Application of Linear Regression and Artificial NeuralNetwork for Broiler Chicken Growth Performance Prediction

This study was conducted to investigate the prediction of growth performance using linear regression and artificial neural network (ANN) in broiler chicken. Artificial neural networks (ANNs) are powerful tools for modeling systems in a wide range of applications. The ANN model with a back propagation algorithm successfully learned the relationship between the inputs of metabolizable energy (kca...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proteins

دوره 43 3  شماره 

صفحات  -

تاریخ انتشار 2001